A Novel Variable Selection Method based on Frequent Pattern Tree for Real-time Traffic Accident Risk Prediction
نویسندگان
چکیده
Traffic accident data are usually noisy, contain missing values, and heterogeneous. How to select the most important variables to improve real-time traffic accident risk prediction has become a concern of many recent studies. This paper proposes a novel variable selection method based on the Frequent Pattern tree (FP tree) algorithm. First, all the frequent patterns in the traffic accident dataset are discovered. Then for each frequent pattern, a new criterion, called the Relative Object Purity Ratio (ROPR) which we proposed, is calculated. This ROPR is added to the importance score of the variables that differentiates one frequent pattern from the others. To test the proposed method, a dataset was compiled from the traffic accidents records detected by only one detector on interstate highway I-64 in Virginia in 2005. This data set was then linked to other variables such as real-time traffic information and weather conditions. Both the proposed method based on the FP tree algorithm, as well as the widely utilized, random forest method, were then used to identify the important variables or the Virginia data set. The results indicate that there are some differences between the variables deemed important by the FP tree and those selected by the random forest method. Following this, two baseline models (i.e. a nearest neighbor (k-NN) method and a Bayesian network) were developed to predict accident risk based on the variables identified by both the FP tree method and the random forest method. The results show that the models based on the variable selection using the FP tree performed better than those based on the random forest method for several versions of the k-NN and Bayesian network models. The best results were derived from a Bayesian network model using variables from FP tree. That model could predict 61.11% of accidents accurately, while having a false alarm rate of 38.16%.
منابع مشابه
A Model for Predicting Schoolchildren Accidents in the Vicinity of Rural Roads based on Geometric Design and Traffic Conditions
awareness they gain from their surroundings. Recent statistics indicate that about 40 percent of road accident fatalities are pedestrians, 30 percent of which are under 18 years old. Based on the fact that almost two million Iranian students study in the vicinity of rural roads, this paper aims to develop a model for predicting the risk of students’ accidents near the aforementioned schools. Th...
متن کاملآنالیز بقای رانندگان سیستم حمل و نقل جاده ای بر اساس تیپ شخصیتی
Introduction: Traffic accidents are an indispensable part of driving. In Iran because of large number of traffic accidents, several studies performed on the association between traffic accident and various personal and environmental factors Method: This retrospective analytical study was performed during April till August 2016 on 569 drivers, at Yazd city. The participants were professio...
متن کاملA Novel Fuzzy Based Method for Heart Rate Variability Prediction
Abstract In this paper, a novel technique based on fuzzy method is presented for chaotic nonlinear time series prediction. Fuzzy approach with the gradient learning algorithm and methods constitutes the main components of this method. This learning process in this method is similar to conventional gradient descent learning process, except that the input patterns and parameters are stored in mem...
متن کاملTraffic Condition Detection in Freeway by using Autocorrelation of Density and Flow
Traffic conditions vary over time, and therefore, traffic behavior should be modeled as a stochastic process. In this study, a probabilistic approach utilizing Autocorrelation is proposed to model the stochastic variation of traffic conditions, and subsequently, predict the traffic conditions. Using autocorrelation of the time series samples of density and flow which are collected from segments...
متن کاملVehicular Route Prediction In City Environment Based On Statistical Models
Travel route analysis and prediction are essential for the success of many applications in Vehicular Ad-hoc Networks (VANETs). It is challenging to make accurate route prediction for general vehicles in city settings due to several practical issues such as very complicated traffic networks, the highly dynamic real-time traffic conditions and their interaction with driver’s route selections. The...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- CoRR
دوره abs/1701.05691 شماره
صفحات -
تاریخ انتشار 2014